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Financial and economic history is strewn with bubbles and crashes, booms and busts, crises and 
upheavals of all sorts. Understanding the origin of these events is arguably one of the most important 
problems in economic theory. In this paper, we review recent efforts to include heterogeneittes and 
interactions in models of decision. We argue that the Random Field Ising model (rfim) indeed 
provides a unifying framework to account for many collective socio-economic phenomena that lead 
to sudden ruptures and crises. We discuss different models that can capture potentially destabilising 
■ self-referential feedback loops, induced either by herding, i.e. reference to the peers, or trending, i.e. 

reference to the past, and account for some of the phenomenology missing in the standard models. 
We discuss some empirically testable predictions of these models, for example robust signatures of 
RFlM-like herding effects, or the logarithmic decay of spatial correlations of voting patterns. One of 
the most striking result, inspired by statistical physics methods, is that Adam Smith's invisible hand 
can badly fail at solving simple coordination problems. We also insist on the issue of time-scales, 
that can be extremely long in some cases, and prevent socially optimal equilibria to be reached. 
As a theoretical challenge, the study of detailed-balance violating decision rules is needed to decide 
whether conclusions based on models obeying detailed-balance are indeed robust and generic. 
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I. INTRODUCTION 

Traditional economics treat the aggregate behaviour of a whole population through a "representative agent" ap- 
proach, where the heterogeneous preferences of individual agents are replaced by an average preference curve. Agents 
are supposed to determine their action in isolation, with no reference whatsoever to the decision of others; in other 
words, interactions between agents are neglected. The need to account for interactions seems however quite com- 
pelling: imitation and social pressure effects must be responsible for the appearance of trends, fads and fashi on or 



mass panic, that would be very difficult to understand if agents were really insensitive to the behaviour of others. [135 1 
Financial markets ~ and for that matter whole economies - are prone to crises and exhibit sudden discontinuities and 
crashes, even in the absence of any violent exogenous event. The idea that some crises are endogenously generated 
dates back to Keynes [l| and Minsky [2], with many more recent apostles - see e.g. 0-01 ■ Collective effects can be 
the key to the success of a brand, a iDOok or a movie fsj, or else of a new technology or a vaccination campaign. 
But they can also be detrimental and do lead to major catastrophes when imitation cascades are based on unreliable 
information or dangerous ideas, and when social pressure supersedes rational thinking. In any case, social imitation 
often leads to distortion and exaggeration, i.e. a decoupling between the cause and the effect, which in turn generates 
inequalities and concentration (or "condensation" ) phenomena, see sections IV D| IVII Al These effects may grow ever 
stronger in our Internet and social network dominated societies. 

From a theoretical point of view, interactions may indeed lead to an aggregate behaviour that is completely different 
from that implied by a representative agent approach. Catastrophic events and discontinuities can occur at the macro 
level, when the behaviour of independent agents would be perfectly smooth and featureless. Models of collective 
behaviour that illustrate those "surprises" on the way from micro to macro are of course familiar to statistical 
physicists - with Van der Waals and Weiss as early heroes Similar ideas started to develop in quantitative 
sociology in the seventies, with models introduced by Schelling (1971, 1973) To'] and Granovetter (1978) fu\ that 
are formally very close to spin models of magnets (see below, section HV A|) . In fact, the analogy between magnetic 
polarization and opinion polarization was noted and formalized slightly earlier by a physicist (Weidlich) in 1971 [l2l | 
who coined the word "Sociodynamics" , and slightly later by Galam, Gefen & Shapir (1982) [13l|, in a manifesto for 
"Sociophysics" . 

As it is well known, interactions can lead to multiple equilibria and ergodicity breaking, which in turn lead to 
very interesting effects like "punctuated equilibrium" , hysteresis (or path dependence) and aging. These theoretical 
scenarios offer enticing metaphors to understand economic systems as well, in particular the emergence of trust, 
norms and conventions. After all, a prosperous economy can only be achieved through efficient coordination and 
collaboration. Although anticipated by Keynes and pioneered by FoUmer in 1974 (again in the context of spin 
models! [l3|). these ideas only really permeated into economics in the early nineties, partly nurtured by the Santa 
Fe meetings "T/ie Economy As an Evolving Complex System" [l6|. As Kirman (1992-93) was forcefully arguing 
against the representative agent framework [17] and proposing his 'ant' recruitment model [18i] . Brock and Durlauf 
wrote a series of remarkable papers [19I - I2H using statistical mechanics methods to elicit the importance of social 
interaction s in economics, emphasizing that the utility or payoff an individual receives depends directly on t he ch oices 
of others. Il36l Although not yet fully mainstream, the field of "social dynamics" is now well appreciated jl37| (see 
the books |23l. [23| for enlightening discussions, and [2^, [2^ and refs. therein for another strand of literature on 
"herding" effects; see also [23|). In parallel, physicists have actively contributed to the field, with a variety of models 
for coordination, contagion, opinion formation, segregation, herding, etc. - see e.g. |28l43l| for recent overviews of 
these endeavours. 

The present paper is intended to be a personal, somewhat informal review, centred around the use of Random Field 
Ising models (rfim) as a generic model for crashes, opinion swings and discontinuities in a socio-economic context. 
The RFIM provides a unifying framework and an enticing "story" for many disparate collective phenomena, including 
overnight trust evaporation like after Lehm an's bankruptcy in 2008, or the persistence of cultural traits or habits, even 



when damaging for the community. [138| Interestingly, some of the predictions are robust and can be quantitatively 
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tested against data. We then discuss some theoretical hmitations of the simplest (mean-field) version of the model. 
Three issues are, in our mind, particularly relevant. One is the importance of history and memory in the preference of 
individual agents, in the absence of any interaction effects (this is called habit formation or "stickiness") - see section 
IVII Al Another is the use of decision making rules (or selection probabilities) that have become standard in the above 
cited literature, and that amount to assuming detailed-balance in the physics jargon. How much non detailed-balance 
rules would affect the qualitative features is very much an open problem (section TVI Bp . The third one concerns spatial 
effects, that should play a major role in some cases, as illustrated by election turnout statistics (section [V CI) . Finally, 
we cursorily discuss alternative (or rather, complementary) models of crashes and regime shifts based on feedback 
loops between past events and current decisions (sections IVII Bl IVII Cp . 

Many ideas in this paper have been "long and widely known" . However, they are only well known to those who 
know them well. My experience is that even simple ideas may have a hard time permeating into other communities 
(the adoption models presented below could even be used to explain why! js^). Sometimes, a detailed translation 
helps. I hope that the present paper can serve this purpose and be a useful read, in particular for students in physics 
and in economics interested by the issue of crises and collective phenomena. 

II. A GENERIC BINARY CHOICE MODEL: THE RANDOM FIELD ISING MODEL 

A. Set up of the model 

The so-called 'Random Field Ising Model' (rfim) was proposed to model hysteresis loops and other anomalous 
properties of disordered magnets (331 [s^]- The hysteresis loop problem is an example of a collective dynamics of 
flips (the individual magnetic spins) under the influence of a slowly evolving external solicitation. Depending on the 
parameters, the flips may take place smoothly, or organize in intermittent bursts, or 'avalanches', leading to a specific 
acoustic pattern called Barkhausen noise. But because the model is so generic, it can be used as a simplified model 
for a host of other physical situations as well such as earthquakes, fracture in disordered materials, failures in power 
grids, etc. These systems respond to the driving in a series of avalanches spanning a broad range of scales, which 
Sethna, Dahmen and Myers (2001) proposed to call "crackling noise" [ssj . 

As was originally proposed in |13l . |36| , the model can also be transposed to represent binary decision situations 
under social pressure, and influenced by some global information (such as the price of a product or the quality of 
a technology) o r hy Zeitgeist. This transposition was recently discussed in several socio-economical contexts in, e.g. 
[37l - l42j (see 43-'46] for variations on the theme). The model has a rich phenomenology. In particular, discontinuities 
appear in aggregate quantities when imitation effects exceed a certain threshold, even if the external solicitation varies 
smoothly with time. In this situation, the aggregate demand is a multi- valued function of the price (ssj . Below the 
threshold, on the other hand, the behaviour of "demand" , or of the average opinion, is smooth, but the natural trend 
can be substantially amplifled and accelerated by peer pressure, possibly leading to sudden crises. 

We will assume that each agent i is confronted to a binary choice, the outcome of which being noted Si — —1,1. 
This binary choice can be to vote or not to vote, or to vote yes or no in a referendum; it can be to buy or not to buy 
a certain good (38l - l40| . to evade tax or not, to clap or to stop clapping at the end of concerts, etc. [4l|. Textbook 
examples in sociology are to attend or not to attend a seminar [l^, to join or not to join a riot [ll| or a strike p^ . 
to go or not to go to El Farol crowded bar [H, Of course, examples can be multiplied ad libitum. We wish 
to add two more in view of their importance: one is to trust or not to trust. This encompasses a large number of 
situations that recently made the news: credit markets and credit ratings; trust in future economic prospects; trust 
in the validity of the models used to value derivative products [44]; trust in the value of money itself, etc. The second 
is adopting or not an environment friendly technology (i.e. solar heating) which is individually costly but collectively 
valuable. This is closely related to the famous "tragedy of the commons" [1^ wherein individual short term interest 
is in conflict with long term collective welfare. 

It is reasonable to assume that the decision of agent i depends on three distinct factors: 

• (i) his personal inclination, preference or "willingness to pay/adopt" , measured by a real variable fi e (— oo, -l-oo) 
which we take to be time independent (but see section IVH Ap and distributed according to a certain p.d.f /?(/). 
Large positive f's means a strong a priori tendency to decide 5 = 1, and large negative /'s a strong bias towards 
S = -l. 

• (ii) public information, affecting all agents equally, such as objective informations on the scope of the vote, the 
price of the product agents want to buy, the balance sheet of the banks, the advance of technology, etc. The 
influence of this time dependent common factor, will be called the incentive held F{t), again a real variable in 
(— oo, +oo). 
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• (iii) social pressure or imitation effects; each agent i is influenced by the previous decision made by a certain 
number of other agents j in his "neighbourhood", V^. The influence of j on i is taken to be JijSj, that adds to 
fi and F. If > 0, the decision of agent j to buy (say) reinforces the attractiveness of the product for agent 
i, who is now more likely to buy. This reinforcing effect can obviously lead to an unstable feedback loop, as 
discussed in section HVBI If on the contrary Jij < 0, the action of agent j deters agent i from making the same 
choice. This "anti-conformist" or contrarian tendency, although rarer in human nature, can sometimes exist 
and be relevant, but this will not be pursued further here. 



B. Decision rules 



In summary, the overall perceived incentive Ui of agent i to choose +1 over —1 is 

U^{t) =f,+ F{t) + J^lSJ{t - 1). (1) 

Now one has to specify a decision rule, given a certain value of the total incentive. The simplest rule is: 

S,{t)^sign[U,{t)-Uth], (2) 

meaning that the decision to "buy" is reached whenever the incentive Ui reaches a certain threshold value Uth, which 
can be chosen without loss of generality to be zero. |l39j | 

However, the above decision rule is perhaps too deterministic and restrictive. One may imagine that agent i could 
not pay attention to or misread the public information (which is often intrinsically ambiguous and hard to interpret) 
or else have a random, time dependent component to his preference (a sophisticated way of saying people can be 
fickle). [l40j | A more general rule that is a standard in the choice/decision making literature (see section FVl Ap is the 
so-called "logit rule" or "quantal response" which makes the decision a random variable, with probability: 

P(5, = +l;C/,) = ^^^3^; P(5. = -l;C/0 = l-P(^,: = +l;[/^), (3) 

where /3 is a parameter that specifies the amount of noise (or "irrationality") in the decision process, and is the 
analogue of inverse temperature in physics. When /3 — ^ 0, incentives play no role and there is a random 1/2, 1/2 
probability of the two decisions, whereas when /3 — >■ oo, one recovers the deterministic rule given by Eq. 

It is in fact very natural to generalize slightly the above rule to keep some memory of the previous decision and 
allow for possibly time dependent incentives: 

P{S.Xt) = +l\S,{t - 1) = -1; U.,) = Y^-^; P{S^{t) - -mit - 1) = -1; Ui) = l- Y^-^, (4) 

and similarly when all signs are flipped, with Ui — >■ ~Ui. The coefficient fi < I can be interpreted as a probability to 
reconsider one's past choice between t — 1 and t. When /i = 1, all memory is lost and one recovers the previous rule 
Eq. dni). 

These decision rules might at this stage appear as somewhat arbitrary, and guided by the fact that they will allow 
one to use directly Boltzmann-Gibbs measure and use results from statistical mechanics. We will comment on those 
rules below (sections I VI AlVIBj) - although considerable effort has been devoted to axiomatize decision theory [i^ , 
we feel that there is indeed some leeway here that might lead to interesting problems for future research. 



C. The mean- field limit 



If all Jij > 0, the above model is known in physics as the Random Field Ising model at temperature T = 1//3, 
and has been intensely studied in the last decades, in particular at T = (see e.g. [H, [13, HO] ) ■ In physics, natural 
networks of connections are d-dimensional regular lattices, but other topologies, more natural in a socio-economic 
context such as the fully connected case, regular trees or random graphs have been studied as well. The qualitative 
phenomenology however does not depend much on the chosen topology, nor on the distribution of idiosyncratic fields 
p{f), although quantitative details might be sensitive to these specifications. For simplicity, and in line with many 
previous papers [H, [H, [U, [38l - l4li [sif we will thus mostly restrict here to the "mean-field" case where Jij = J/N, 
pairs. This does not mean that each agent consults all the other ones before making his mind, but rather than 
the average opinion m = Si, or total demand, becomes public information, and influences the behaviour of 
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each individual agent. This is in fact a very reahstic assumption in many cases: for example, the total sales of a 
book, or number of viewers of a movie, is certainly an important piece of information for the consumers. It is also 
thought that the evolution of the public opinion can be affected by polls or by confidence index, i.e. by a proxy of 
the average opinion [52|-[53|. In the case of financial markets, the price change itself can be seen, on a coarse-grained 
time scale, as an indicator of the aggregate demand (although the detailed relationship between the two might be 
quite subtle, see [sH. Isg!]). While this mean- field description may be very reasonable in certain settings, one expects 
that network/local effects may be relevant in other cases, for which the neighbourhood of agents is a more specific 
community. This locality might actually change quite a bit the features of the model (see section HVB I) . 

We now turn to an analysis of the RFIM model in different limits, and try to summarize the salient results and their 
interpretation in a socio-economic context. 



III. THE CASE OF AN HOMOGENEOUS POPULATION: THE ISING-WEIDLICH MODEL 

A. Dynamical equations and equilibrium 

Let us start by analysing the case where fi = /, V«. Up to a shift in the public information field F, one can always 
set f — 0. There are ^ 1 agents, and at time t, the fraction of them who have chosen the S* = -f 1 alternative is 
(j) — N-^-/N. In the mean-field limit, the total incentive to choose -1-1 over —1 is simply: 

U = -C + 2J(t), C = J-F (5) 

If we speak of adopting individual solar heating, for example, C can be interpreted as the immediate cost for installing 
a new heating system, whereas the term 2 J(/> encapsulates both the expected cost decrease and technical improvements 
as more people shift to the new technology, and the social pressure that makes it "politically incorrect" to stick with 
hydrocarbon greedy devices. 

Now, we use the dynamical update rules given by Eqs. ^ above, and set z — e^^ . It is e asy to show that the 
following probabilities hold for evolution of the number of 'adopters', iV+ between t and t + 1: |14l| 

P{N+ ^ iV+ + l) = ^-^(1-0); P{N+ ^ iV+-l) = /i-^<^; P{N+ ^ N+) = l^J^[z + cj>{l-z)]. (6) 
I + z 1 + z 1 + z 

From the above, one gets the evolution of the average and the square-average (N^): 

{N+)t+i-{N+)t=fi[^-4,]; (7) 

and 

{Nlh+, {Nl), = 2^(A^+).[-^ ~^]+ /.^i-^^ (8) 

1 + z 1 + z 

These equations were obtained in the present sociological context by Weidlich [5l[ , but are of course very standard 
in the context of the mean-field Ising model. 

Taking /x = dt infinitesimally small for convenience, and comparing with the predictions of the following Ito 
stochastic differential equation: 

diV+ = J-{(j))dt + S(0)dl^ (9) 
(where dW is the usual Brownian noise) one finds, by identification: 

Ti<P) = -r^-^; E^(^).ii-^ (10) 

1 + z 1 + z 

In order for this equation to be well behaved in the large N limit, a change of time scale must be performed ast ~ t/N 
(meaning that over a finite change of t, a finite fraction of the population has been offered the possibility to switch). 
This finally leads to: 

d(j) = T{(j))At+^^AW. (11) 



From now on, we will drop the tilde. Note that the scale of the noise goes down as l/VW with the population size. 
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The "equilibrium" state(s) of the system are such that {d(f)) = 0, i.e. 0* such that F{<jf) = 0, leading to: 



z 



1 + z* 



^* ^^Pi-c+2jr)^ (12) 



Setting m = 2^— 1, one recovers (as expec ted) the standard Curie- Weiss mean- field equation, re-derived by Brock 
and Durlauf [2l| in the present framework: (l42j 



m* EE 20* - 1 = tanh 



|(Jm*-hF) 



(13) 



The solutions of that equation are well known. When F = 0, there is a critical value /3c = 2/ J separating a high 
noise regime /3 < Pc where agents shift randomly between the two choices, with (/>* = 1/2. As first noted by Weidlich 
p^ . a spontaneous "polarization" of the population occurs in the low noise regime /3 > /3c, i.e. (p* 7^ 1/2 even in the 
absence of any individually preferred choice (i.e. F — 0). When F 0, one of the two equilibria is exponentially 
more probable than the other, and in principle the population should be locked in the most likely one: (fi* > 1/2 
whenever F > and 4>* < 1/2 whenever F < 0. In the limit /3 — > 00, the above equilibrium analysis suggests that as 
soon as F > (i.e. when the adoption cost C is less than J), the whole population should move towards the "right" 
equilibrium (/)*«!. 

Unfortunately, the equilibrium analysis is not sufficient to draw such an optimistic conclusion. A more detailed 
analysis of the dynamics is needed, which reveals that the time needed to reach equilibrium is exponentially large 
in the number of agents, and as noted by Keynes, in the long run, we are all dead. This situation is well-known to 
physicists, but is perhaps not so well appreciated in other circles - for example, it is not discussed in [2l| . This is the 
main message of the present section, which would otherwise not be much more than a textbook exercise. 



B. Potential barrier and metastability 

Define a potential function V{(j)) such that = —dV/d4>. Up to an irrelevant additive constant, one finds 

V(0) = f -^lu(l + e^™^^)). (14) 

The analysis of V{(f>) is easy in the limit /3 ^ 1. Since </> is G [0, 1], one finds that as long as |F| < J the potential 
has two minima situated, as expected, very close to = and cf) ~ 1. The difference V{4> = 1) — V(0 — 0) is equal to 
—2F meaning, as explained above, that the 0* « 1 equilibrium is indeed favored when F > 0. But there also appears 
an energy barrier between the two minima, situated at 0max = (1 ~ which makes it hard for the system to 

crossover from the "bad" minimum to the "good" one. Using the above Langevin equation, Eq. ([TT|) . one finds that 
the time r needed for the system, starting around = 0, to reach 0* « 1 is given by: 

r cx exp [AN{1 - F/J)] , (15) 

where A is a numerical factor. The important point about this formula is the presence of the factor iV(l — F/J) in 
the exponential. This means that whenever < -F < J, the system should really be in the socially good minimum 
(/)* ~ 1, but the time to reach it is exponentially large in the population size. In other words, it has no chance of 
ever getting there on its own for large populations. Only when F reaches J, i.e. when the adoption cost C becomes 
zero will the population be convinced to shift to the socially optimal equilibrium - simply because </>* w is not a 
minimum anymore (note indeed that (/)max — as F J). 

As /3 decreases towards /3c (i.e. as the noise in the decision process increases) , the two minima (/)j_ get closer to each 
other and the barrier between then gets smaller, until hysteresis and polarization effects disappear altogether. 

Let us summarize what happens in a situation where the cost C of adopting a new technology, a new idea, or a 
socially beneficial attitude, decreases steadily as a function of time in our (so far) homogeneous population. When 
individual make near-optimal choices, the population as a whole will not adopt until C = 0, even if it would be optimal 
for the community as a whole to adopt as soon as C < J - it would actually do so if one was prepared to wait an 
exponentially long time. This can be seen as a mathematical incarnation of the tragedy of the commons. As a function 
of time, one should see an extremely small number of adopters until the cost drops to zero, in which case the whole 
population finally shifts to the new paradigm. This is very different from the standard model of innovation diffusion, 
based on the following simple differential equation proposed by Bass in 1969 ([57.], and for a recent illuminating review 

My- 



(16) 
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which says that the rate of adoption increases with the number of people who have adopted. This equation leads 
to the famous "S-curve" for adoption. It predicts that the adoption speed is maximum when — 1/2, at variance 
with the above scenario where the adoption speed abruptly increases when is still quite small. Note however that 
the discontinuity in time should be smoothed by the speed at which information about the behaviour of others is 
spreading; but when applied to financial markets, for example, this time can indeed be very short. 

What are the solutions to escape from the "bad" equilibrium as soon as it becomes collectively profitable to move 
to the "good" one? Clearly, this is a coordination problem, which becomes hard to solve when TV is large. Apart 
from lowering the cost C, the model above suggests at least three possibilities to make the barrier easier to cross: a) 
increase the variance term E by coordinating the moves (instead of having independent decisions): this can be done 
through advertising campaigns, word of mouth, etc.; b) decrease /3, i.e. increasing the noise, for example when the 
information about the true costs is ambiguous; c) decrease TV, i.e. the number of people in direct interaction - people 
do not adopt because nobody else adopts. In physics, the existence of mutually inaccessible minima with different 
potentials is a pathology of mean-field models that disappears when the interaction is short-ranged. In this case, 
the transition proceeds through "nucleation" , i.e. droplets of the good minimum appear in space and then grow by 
flipping spins at the boundaries. This suggests an interesting policy solution when social pressure resists the adoption 
of a beneficial practice or product: subsidize the cost locally, or make the change compulsory there, so that adoption 
takes place in localized spots from which it will invade the whole population. The very same social pressure that was 
preventing the change will make it happen as soon as it is initiated somewhere. 

Before leaving the homogeneous Ising-Weidlich model of this section, we would like to note that the solar heating 
example is in fact not be the best one, because the dynamics of the model assumes that choices are to some extent 
reversible, i.e. each agent can in principle flip back and forth between the two options. While this is clearly the case 
for opinions, tax-evasion, safe driving, trust, etc., it is harder to imagine that agents can easily switch between solar 
cells and heating oil - at least on short enough time scales to make the above description useful. The model in the 
next section is actually better suited for such irreversible choice situations. 



IV. THE ROLE OF HETEROGENEITIES: SCALING AND AVALANCHES 
A. The zero-temperature rfim and classical models of adoption 

Let us now come back to the idea that agents are not clones and have different preferences or willingness to pay 
/,;S, distributed according to a certain p.d.f p{f)- The case where social pressure is absent, i.e. Jij = is very simple 

to analyse when — )• oo. Call P<{f) = //^ dxp{x) the cumulative distribution of /i, i.e. the probability that fi < f. 
The aggregate demand, or average opinion m for a given incentive field F{t) = F is easily obtained as: 

mo = -P<i-F) + (1 - P<{~F)) = 1 - 2P<(-F). (17) 

(The subscript means that J = here). As F{t) increases slowly from — cx) to -l-oo, the average opinion evolves 
from — 1 to -|-1 in a way that mirrors exactly the distribution of a priori opinions in the population. For a generic 
distribution of fi's (for example, Gaussian), the opinion evolves smoothly as the polarization field is increased - see 
Fig. 1, blue line. If one interprets F as minus the price p of a product, and fi the price at which an agent is happy 
to buy, the total demand curve d — {m + l)/2 is clearly equal to the number of buyers, i.e. 

/•OO 

d{t) = TV(1 - P< {p{t)) = N / dxp{x), (18) 

Jp{t} 

which increases as the price decreases. For a unimodal distribution, d{t) will look like a smooth "S-curve" as the price 
decreases, or as the quality of the product increases. This is called the "moving equilibrium" model in the innovation 
diffusion literature [58| . 

The situation can change drastically when imitation is introduced. We keep here with a mean-field coupling to the 
average opinion, and comment on the robustness of the results later, section HV FI This global feedback effect simply 
shifts F to F -I- Jm, leading to a self-consistent equation: 

m* = 1 - 2P<(-F- Jm*). (19) 

The threshold model of Schelling and Granovetter [l3, [HI is a particular case, although usually expressed slightly 
differently. As above, call the number of agents deciding to attend a seminar, or join a riot, etc. Each agent has 



8 



a preference expressed as a conformity threshold: if the observe d num ber iV+ exceeds a i dependent number that we 
set as A^(l — fi), then agent i joins as well the next time round. [l43l | This leads to a dynamical equation of the form: 

0t+i=/ d/p(/) = l-P<(l-</)t), with 0t = ^ (20) 

The "equilibrium" value is given by the fixed point solution (jj* — 1 — P<(1 — 4>*) which coincides with Eq. (1191) with 
J = —F = 1/2. Eq. (IT^ contains F as an extra degree of freedom, which allows one to treat both conformity and 
the moving equilibrium idea when F is time dependent. Note that when all J^j's are positive (which is the case of 
the mean-field model), decision changes are 'irreversible' when F(t) has a monotonous evolution in time. In other 
words, if Si flipped from —1 to +1 at a certain moment in history, it will never flip back to —1 later. This would not 
be true in the much richer (and more complicated) case where Jij take both positive and negative signs, describing 
contrarian effects. 



B. Imitation vs. heterogeneity 

Coming back to Eq. ((T9)) when imitation is weak enough, one can expand the right hand side in powers of J, leading 
to first order to: 

(21) 



1 - 2p(F)J' 



This equation shows that around the point where the slope of mo vs. F is maximum coincides (for symmetric 
distributions) with the point where the speed of variation of opinion changes is maximally amplified: imitation leads 
to exaggeration. As imitation becomes stronger, the above perturbative expansion breaks down. Still, the maximum 
slope of m* vs. F increases and finally diverges for a critical value J = Jc, beyond which m* is a discontinuous 
function of F (see Fig. 1, red line). Much as in the previous section, the self-consistent equation Eq. (IT9|) has, for a 
range of F, three solutions for m, one of which being unstable (see Fig. 1). In an economic context, it means that 
the demand for a product can be a multi- valued function of price, i.e. the possible coexistence of a high demand and 
a low demand solution for the same price. This has been fully investigated in ^38l - l40| . 

The solution chosen by the system depends on history. Suppose one starts with F — — oo and m = — 1 (zero 
demand), and slowly increase F to make the alternative choice more appealing. The average opinion will first 
follow the lower branch until it jumps discontinuously to the upper branch, for a certain threshold field Fc{J) (and 
symmetrically on the way back, at —Fc{J), as the field is decreased). Although Fig. 1 is qualitatively similar to the 
results of the previous section (which formally corresponds to cr = 0, where a is the width of the distribution p{f)), 
there are many crucial differences. For example, the m* jump amplitude is not equal to 2 but depends continuously 
on J even when /3 — >■ oo, and vanishes when J — ?> J+ as ( J — Jc)^ , where /? = 1/2 in mean-field or on tree-like 
graphs. Another difference is that the dynamics between F — —oo and Fc{J) when the jump occurs is made of 
avalanches of different sizes, which we will discuss in section FlV Dl below. A third difference is the existence of many 
more "equilibrium states", beyond the ones corresponding to the protocol that we describe here (i.e. F = — oo and 
all spins down at i = 0). Other states can be reached, for example preparing the system with m = 0, F = one will 
follow a totally different hysteresis loop as F is inc reased. This "sensitivity to initial conditions" , or strong history 
dependence, makes the model quite appealing. [144] 

What sets the critical value Jc of the interaction parameter beyond which strong herding effects occur? Dimensional 
analysis immediately imply that Jc = Aa where A a numerical constant (perhaps infinite) that depends on the detailed 
shape of p(/), and also on the topology of the graph. This means, as intuitively expected, that jumps and collective 
frenzy happen more easily in homogeneous populations (low cr) than in strongly heterogeneous populations. Stability 
(of markets, democracies,...) requires diversity. For the mean-field model with a Gaussian p, A = \/27r « 2.5. If the 
graph is a regular tree, then A < oo whenever the number of neighbours is > 4 [sOj, but the transition disappears for 
a tree with 2 (i.e. a line) or 3 neighbours. For these low connected lattices, interaction effects are never strong and 
there is no hysteresis phase. On a two-dimensional square lattice with each bond of strength J, the threshold is for 
Jc ~ 1.56(7 [5^ and Jc ~ 0.46(7 for a three-dimensional cubic lattice [sJl (both for a Gaussian p{f)). 



C. Critical scaling 

The vicinity of the critical point J ^ Jc reveals interesting critical properties, to a large extent independent of the 
detailed form of p{f). Noting e = Jc — J the distance from criticality, one finds that the opinion slope S = dm/dF 
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FIG. 1: Average opinion, or aggregate demand, or overall trust (etc.) m, as a function of the external incentive field F(t) in the 
RFIM. For an imitation parameter J < Jc, the curve is smooth. But when J > Jc an hysteresis effect appears with two possible 
stable states for the same value of F, plus one unstable state. As F decreases the overall trust remains high even for negative 
values of F, before suddenly collapsing at -—Fc{J) to the lower branch (as indicated by the arrow). The hysteresis effect means 
that trust can only reappear if F grows back beyond +i^c(J), i.e. much beyond the value where the crash happened. 



takes a scaling form in a relatively wide region around of Fc{J) [SJ]: 

where the function Q{x) can be computed explicitly (4ll. l60j. and is universal. In particular, Q{Q) is a finite constant 
and Q{x — > oo) ^ x^'^/'^. Eq. (|22|) means that the slope, as a function of F, peaks at a maximum of order e^^ and 
remains large on a small window of field of order w ~ e'^/^. In other words, the height h of the peak behaves as an 
anomalous power of the width w~'^ with k — 2/3. This is strikingly different from what is expected for any regular 
model, like the moving equilibrium model (J — 0), for which h ^ 1/a and w ^ cr, leading to k = 1. Interestingly, 
this leads to testable predictions, which only rely on the assumption that the external incentive field is a smooth 
function of time - see section IV Al 



D. Avalanches 



If now one 'zooms' on finer details (on scale 1/A^) of the curve m*{F) in the vicinity of J = Jc, one discovers a 
very interesting structure. One finds that the evolution of m* is actually resolved in a succession of 'avalanches' of 
different sizes s, where the first opinion change induces (through interactions) a second one, then a third one, etc. 
After the sth one has flipped, the system is stable again, in the sense that for the current value of F all S'i's agree 
with the sign of the total incentives Ui. 

The statistics of the avalanches is in fact easy to understand in mean-field, because the dynamics maps into the well 
known critical branching process. When one agent flips from Si — —1 to Si = +1, it induces a change of incentive 
AU = 2J/N for all other agents. For a given value of F and m*{F), the agents that are on the verge of flipping have 
a preference fi precisely equal to — — Jm*. The average number of flips induced by the flrst one is thus given by 

2 7 

n = Np{-F - Jm*) X —. (23) 

Each of the newly converted agent can convert on average n new ones, etc. In mean-field, this is equivalent to the 
Galton- Watson branching process with branching ratio n [6l|. It is well known that when n < 1, avalanches are all 

of finite size, whereas when n > 1 some avalanches will diverge and sweep the whole system. In the region of ti ~' 1, 
the distribution of avalanche sizes is a truncated power-law: 



(24) 
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Now, let us explain why n = 1 corresponds to the critical point J = Jc, F = Fc. Eq. ^T9\\ can be written m* = G{m*), 
with G{m) — 1~ 2P^{—F — Jm). One thus sees that the function G{m) must cut the first diagonal once for J < 
and three times for J > Jc- The point J = Jc must correspond to the case where G{m) is tangent to the first diagonal 
at the point of crossing. Therefore, at the critical point, 

G'{m*) = 2Jp{-F - Jm*) = 1, (25) 

which is indeed equivalent to n = 1 (see Eq. ((23)) . Interestingly, exactly the same scenario occurs in the so-called 
Fiber-Bundle model for fracture [S^jE^I, which is very close in spirit to the above models. In a nut-shell, the Fiber- 
Bundle model assumes initially N intact fibers in parallel, subject to an external force NF that is equally distributed 
over all fibers. Fibers are heterogeneous, and when the force acting on the ith one exceeds a certain threshold 6i, the 
fiber breaks and the external force is redistributed over all remaining fibers. It is clear that the fraction of intact 
fibers obeys the following self consistent equation: 

where p{9) is the distribution of fiber strengths. For F < Fc, the above equation has generically two solutions that 
merge for F = Fc and disappear for F > Fc (i.e. the bundle breaks apart). Again, when F = Fc the function P>{^) 
has a slop e equal to one at </>*. The same analysis as above can be carried over for the avalanches in this model as 
weU. [MSj 

This analysis offers an enticing microscopic picture of how large opinion swings actually develop in a population: 
as a branching process that generates avalanches. Most of them are small, but some may involve an extremely large 
number of individuals, without any particularly large change of external conditions (i.e. change in F{t)). The power- 
law distribution that appears is tantalizing and might enable one to understand why burst of activities and power-laws 
appear in the distribution of returns in financial markets, for example. But there is at present no convincing argument 
that would make this analogy sharp. 



E. Delays and trends 



The above analysis assumes that the external incentive field F{t) varies extremely slowly, so that the state of the 
system can follow adiabatically the equilibrium condition Eq. (fTOl) . It may be useful to generalize the model to include 
lag effects, and to write the following time evolution equation |58|: 

^^p[l-2P^{-F{t)-Jm*)-m*], (27) 

where defines an equilibration time scale, i.e. the time needed for agents to adapt to the new piece of information 
that changed F{t). If /i^^ is very short compared to the evolution time of F{t), we are back to the previous analysis. 
Otherwise, lag effects matter and could for example affect the avalanche size distribution P{s) for large s. 

Another potentially interesting generalisation is to include the influence of past trends. People are not only sensitive 
to the contemporaneous behaviour of others, they also take cues from past patterns. An obvious candidate is the 
speed at which m* has varied in the past, which may give a sense of urgency and adds to the external incentive F{t). 
The simplest model that encodes this would be Eq. (|27)) with F{t) replaced by: 

F(O^F(0+g^'dt'e"r(*-*')^, (28) 

where F"-'^ is the time scale over which the past trend is computed and g the strength of the coupling to past trends. 
This could be an interesting model for the dynamics of financial markets, similar to the one discussed in section fVlI CI 
Other examples of feedback between past history and present decisions will also be reviewed there. 



F. Phase diagram for noisy decisions 

Before moving to more quantitative evidence for the type of effects that the rfim is supposed to capture, it is 
important to end the present section with a discussion of the role of the coefficient (3 = 1/T in the decision rules. Up 
to now, we considered that /3 = oo - i.e. decisions are 'rational' and conform to the overall incentive. Si = sign[[/i]. 



11 



Jumps (SG) 




Jumps (F) 



T=l/p 

FIG. 2: Phase diagram, in the plane (T = 1//3, a), of the mean-field rfim. The model in section UlI Al corresponds to the x-axis 
((T = 0), whereas T = rfim of the present section corresponds to the y-axis. The aggregate behaviour is smooth above the 
red boundary, but exhibits jumps and hysteresis below. An even more complex Spin-Glass-like phase, with an exponentially 
large number of metastable states, appears to the left of the dotted green line (see [64|] for more details) . 



Introducing some amount of noise does not immediately destroy the above picture: for a given imitation strength J, 
there is still a critical value crc{T) of the heterogeneity below which jumps occur and above which the demand curve 
becomes smooth. Similarly, for a fixed ct, jumps remain until decision noise becomes larger than a d-dependent value 
Tc((t) (i.e. for /3 smaller than /3c = '^/Tc). The situation is summarized in Fig. 2, which shows the region of the 
(T, cr) plane where jumps and hysteresis occur, located below the red line. Above that line (i.e. where heterogeneities 
and/or noise are strong enough, there is a single equilibrium and a unique demand/opinion curve). The model in 
section UlI Al corresponds to the x-axis of this graph (tr = 0), whereas the results of the present section correspond 
to the y-axis (T = 0). From the point of view of potential applications of the model, it is reassuring to see that the 
whole scenario predicted by the rfim, and in particular the transition between a "crisis" phase and a smooth pha se, is 
quite robust against details (noise, network topology, shape of the probability distribution of preferences, etc.). |l46l | 
Finally, note that there is a second, green line in Fig. 2 |64| . separating a regime of strong history dependence where 
the number of equilibrium states is exponential in the number of agents and hysteresis sub-loops are stable, from a 
phase that is more similar to what happens in the homogeneous case a = 0, with a single (large) hysteresis loop. 



G. Summary 



In summary, the transposition of the rfim to opinion/demand formation predicts that the average opinion evolves, 
as a function of the global solicitation, very differently if imitation effects are weak (in which case the evolution 
is smooth) or if imitation exceeds a certain threshold, in which case aggregate quantities are discontinuous and 
catastrophic avalanches may be triggered. More quantitatively, the model predicts that in generic situations, the 
speed of change should peak more and more as the critical point is approached. As a function of time, the speed of 
change exhibits a maximum of height h and of width w, the two being related hy h ^ This is very different 

from non-critical adoption models, such as the "moving equilibrium" model, for which h w~^. These predictions 
seem to be qualitatively and even quantitatively relevant in several instances that we describe in the next section. 



V. THE RFIM SCENARIO: EMPIRICAL EVIDENCE AND POSSIBLE APPLICATIONS 



It is now a rather well established fact that people do not make decision in isolation but rely on the choice/opinion 
of others, see [65l [66j for a detailed discussion and many references. Recent work include in particular the artificial 
culture market of Salganik et al. [67| , the analysis of earning forecasts by expert analysts 68] , or the wisdom of crowds 
[69| . Of course, anecdotal evidence concerning famous financial bubbles (like the tulip- mania in Amsterdam and other 
examples of 'madness of crowds' flBl) or famous fads & fashions (like Louis XIV's wig, Catherine de Medicis' forks 
and Richelieu's rounded knives, etc.) make it quite obvious that social pressure can be disturbingly dominant. Keynes 
conceived imitation (herding) as a response to uncertainty and to the recognition of one's own ignorance: people follow 
the crowd because they think (or fear) that the rest of the crowd might be better informed. Gigerenzer provides a 
complementary interpretation [7l|, [72|: people tend to follow rules of thumb and routines. Imitation qualifies well as a 
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FIG. 3: Left: Height of the peak /i, vs. width of the peak w, in a log-log scale, both for birth rates and for cell phones, for 
different European countries (see [JJ]). A best fit leads to a slope (in log scale) of, respectively: —0.71 ± 0.11 and —0.62 ± 0.07. 
The mean field rfim critical prediction h ~ is shown for comparison. Right: Height of the peak h, vs. width of the peak 

w, in a log-log scale, for 17 applause endings. A best fit leads to a slope of —0.64 ± 0.07, again very close to the mean-field 
prediction of —2/3. We also show for comparison the —1 slope corresponding to a 'trivial' behaviour, h ^ 1/w (thin dotted 
line). 

'fast and frugal' heuristic, which furthermore has minimal information costs. However, a recurrent problem has been 
to disentangle effects induced by direct imitations or social pressure from correlations that would appear just because 
people have the same information or the same cultural bias, etc. 

A. Some quantitive evidence 

In that respect, the robust prediction of the rfim alluded to in the previous section allows one to test for the 
existence of herding effects with minimal extra assumptions. This is the program that we followed with Q. Michard 
in (4l| . The idea was to compare the dynamics of the very same phenomenon (for example, the adoption of cell 
phones) in different countries, supposedly characterized by different imitation parameters, and different distributions 
of willingness to adopt. Still, assuming (a) that the speed of evolution of the external incenti ve fie ld F{t) is similar 
in all countries, and (b) that the ratio J/a is not too far from the critical value of the model |l47| then the scaling 
prediction Eq. should hold, leading to a universal relation between the height h and the width w of the evolution 
speed S = dm/dF = v~^dm/dt, where v = dF/dt is the speed of evolution of the external incentive field, assumed to 
be independent of the country or population. If the distance from the critical point varies across the set of available 
countries or crowds, S s houl d reveal peaks with different heights and widths, but all related by ft, ~ w~'^ with k = 2/3 
if imitation is strong. [148| We gath ered data concerning (a) the drop of birth rates in European countries in the 
second half of the XXth century [T^ (b) the increase of cell phones in Europe in the 90s, (c) the way clapping dies 
out at the end of music concerts and (d) crime statistics in different US states in the period 1960-2000. Although 
social influence have been argued to be important in this case too [7^] , our data set did not show enough idiosyncratic 
variations across states to be exploited. In the first three cases, we find that our data fits well the picture suggested by 
the model, and that collective effects seem to be present, with indeed k « 2/3 - see Fig. 3. The example of clapping 
is interesting because it is very close to being a controlled experiment [4lj. In that case, we observed both continuous 
and abrupt endings, as predicted by the model. Challet et al. (TSj analyzed data from a Swiss on-line supermarket, 
and report k = 0.55 ± 0.16 for soap consumption. More data would be needed to ascertain the relevance of the rfim 
scenario in real situations. In particular a finer time resolution would allow one to test one of the crucial prediction 
of the model: the presence of bursts, or avalanches, with a tru ncated power-law distribution of sizes (see Eq. (p4)) ). 
This would be a clear-cut signature of imitation cascades. |l49j | 

B. Metaphors 

Still, the model has a huge narrative potential, and is well adapted to describe, qualitatively or quantitatively, 
many situations. For example, Nadal et al. [38-4(| have analyzed the model in great detail in the case of a product 
sold at price p by a monopolist supplier, and in the presence of imitation effects on the demand side. They note that 
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the monopolist has two different strategies: either to attract a large fraction of buyers at low prices or to target few 
buyers willing to pay high prices. There is again a first order transition where the optimal strategy switches abruptly 
from one to the other [40|. They further note that, interestingly, the optimal price is generically just slightly below 
the price at which the high demand equilibrium disappears. A small change in the customers characteristics may lead 
to a decrease of this critical price. If this change is not anticipated by the seller, the posted price may become greater 
than the critical price, resulting in a collapse of the demand, and relate this observation to Becker's interpretation of 
the "crowded restaurant paradox" [2^ . 

The sudden shift from the high demand branch to the low demand branch is also a natural mechanism for crashes in 
financial markets. I personally believe Q, like many others, in the critical importance of trust as a determining factor 
for the prosperity of economies, and for securing well-functioning financial markets. Trust is a collective asset that 
allows efficient coordination and tremendously accelerates business. The most illuminating example is fiat money, 
that can only exist if people trust that a piece of paper will not be worthless tomorrow - and of course if everybody 
gives value to a dollar bill, the dollar bill is valuable. Clearly, money is more efficient than barter because it solves the 
otherwise daunting problem of matching individual preferences. On the other hand, because trust is an immaterial 
sentiment, it can evaporate overnight. Changing one's mind can be instantaneous, the speed of change is not limited 
by physical constraints. This is one of the reason why crisis, crashes and bank runs can occur (this point of view is 
forcefully expressed in, e.g. [10, HH and refs. therein). Think for example of the sudden freezing of the money markets 
just after Lehman's default: clearly, banks stopped lending because all other banks stopped lending. Credit means: 
"I believe (credo) that I will see my money back some day", and this belief can disappear even if no destruction of 
wealth has physically taken place. Established trust is a situation where the subjective probability of default or of 
failure is very small. 

Several quantitative models for trust collapse have been studied in the past few years, see e.g j43 - |46l . [8l| and refs. 
therein. Some are directly related to the rfim, which can indeed be formulated as a model for trust cycles, associated 
to bubbles and busts, bank runs and hyper-infiation. Suppose that for some reason the world is in a euphoric state, 
corresponding to — > +oo and m — 1 - everybody trusts the system. Fiat money is considered as absolutely safe. 
Now, imagine that for some reason the objective trustworthiness of the system F decreases. This could be due to 
a rise of infiation, or to over-valued prices of assets or houses that appear far above any reasonable fundamental 
value. This could also be due to political instability, or an increase in any measured or perceived risk. The most 
risk adverse individuals will act first and sell their assets, or take their money away from the bank, whereas most 
people firmly believe that the system is safe. [150] Still, these initial "irrational" reactions seed wariness in the mind 
of others - what if they were right? Maybe I should do the same? As dark clouds accumulate, it is easy to guess 
what will happen. Either imitation effects are small and people sell one by one their assets, leading to a slow decay 
of the stock markets or a soft landing of the business cycle. Or imitation is beyond the critical threshold and after 
an extended period where confidence is only maintained because everybody else is confident (as in a bubble), a small 
exogenous event (i.e. a small further decrease of F) may be enough to trigger global panic, i.e. crashes, bank runs, 
hyper-inflation, etc. Because the system is hysteretic, the fundamental information F must grow back much beyond 
the point where the crisis occurred for people to regain confidence (see Fig. 1). 

One can think about the model slightly differently. Imagine that the external world, captured by F, is fixed, but 
that the imitation parameter J can vary in time. This is quite realistic: in periods when uncertainty is small, people 
can rely on fundamental information to make a judgement. When uncertainty increases, it becomes more and more 
tempting to take one's cue from the crowd. As J increases past Jc, an initially weakly polarized state m ^ can 
suddenly evolve towards m = ±1 (see Fig. 1). This is a kind of volatility feedback effect, where past volatility may 
trigger more herding effects, finally leading to a crash (see e.g. [i^ [s^ [83j). 



C. Spatial effects 



Another interesting topic is the spatial structure of the incentive field F, which can be interpreted in the RFIM 
setting as the average willingness (to pay /adopt /trust, etc.), around which an idiosyncratic component ft is added. 
It is expected that on several issues, the average willingness strongly depends on regional characteristics and culture. 
One specific example for which spatially resolved data is available are elections. The simplest observable is the turnout 
rate, which reveals persistent patterns, both in time and in space - see [13, [13 and Fig. 4. Calling (f>{r) the turnout 
rate on a given election in a city located at r, one can define the logarithmic turnout rate r(r) as: 

T = ln(-- Te(-oo,+oo) (29) 
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FIG. 4: Heat map of the normalized logarithmic turnout rate r(r) for the 2004 European Parliament election in France, 
Germany, Italy, Poland and Spain. Note the strong, long ranged correlations in the patterns. Inset: Average of spatial 
correlations C(r-) over 20 different French elections, in a semi-logarithmic plot to reveal the quality of the logarithmic behavior, 
Eq. (ESI. See lii] for details. 



The pattern of Fig. 4 suggests a non-tr ivial bahaviour of the spatial correlation of the r field, which is found to decay 
roughly logarithmically with distance: |l5l| 

C(f) = (r(i?)r(i? + r)) - (r(i?))2 « A- Bin r, r = |f|, (30) 

as shown in Fig. 4, inset. This logarithmic dependence holds well for different elections and different countries, up to 
distances r comparable to the size of the country [s^ . Another striking observation is that the voting habits of the 
different regions seem to be extremely persistent in history [83|. 

Assuming a logistic distribution for the idiosyncratic willingness to vote, one finds: jl52j | 



^=^^^r^PF, (31) 

where F is the (local) average willingness to vote. From this identification, we conclude that this average willingness 
itself has logarithmic correlations in space and long-range correlations in time. This suggests the existence of a 
"cultural field" that keeps the memory independently of the presence of particular individuals. 

A simple and rather natural idea is to decompose the time dependent incentive field F{f,t) as a sum of a f- 
independent component F(t) (which measures the average interest in the election), a slowly varying "cultural" field 
ipir, t) that we want to model, and a white noise (in s pace) contribution e(r, t), which models city-specific idiosyncrasies 
and only has short-ranged spatial correlations '8J,|83|- The cultural field ^'i^jt) should evolve due to (a) cultural 
"shocks" that contribute to changing the overall mood in a given city (such as the closing down of a factory or of 
a military base, important changes in population, etc.) and (b) diffusion, as a consequence of the exchange of ideas 
and opinions between nearby cities. Through human interactions, the cultural differences between nearby cities tend 
to narrow, leading to a Laplacian term in the evolution equation (in the continuum limit) . We therefore proposed to 
write the following noisy diffusion equation for i>{f,t) [s^ [8^: 

^-^^=D^^^{r,t)+r^{r,t), (32) 

where D measures the speed of diffusion of the cultural field |l53l | and ri{r, t) a white noise field. If cities were located 
on the nodes of a regular grid, it would be easy to show analytically that the stationary correlation function of the 
field ipifji) decays as a logarithm of distance, as given by Eq. (pO)) . However, the spatial distribution of cities in real 
countries is quite strongly heterogeneous, which leads to significant concavity when C(r) is plotted as a function of 
In r [IHl . Quite remarkably, this concavity is very similar to what is observed for the empirical correlations [U, [s^ , 
which tends to confirm that the long-range spatial correlations observed in election turnout rates can indeed be traced 
to the presence of an underlying cultural field with a diffusive dynamics. 

In fact, we discovered while preparing this review that a diffusion equation very similar to Eq. p2p was also 
proposed by Schweitzer and Hoylst in 2000 [s^l to model opinion formation. They call the "information field" 
and their equation can be written, in the present context, as: 



dt 



Dy'^{r,t) + k[(l){r,t)~-ij{r,t)]. (33) 
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The last term describes a coupling between the past realized turnout rates and the cultural field, i.e. large participation 
rates tend to motivate people to vote more the next time round. However, the presence of such a term in the diffusion 
equation would cut-off the logarithmic dependence of C(r) beyond a distance \J D/k. Since this is not observed 
empirically, 1/fc is probably large (20 years or more) - at least in the context of election participation. 

The idea of a persistent cultural (or information) field that evolves according to a noisy diffusion equation seems 
quite important. If true, many other social phenomena like consumption habits, behavioral biases, etc. should exhibit 
long ranged (logarithmic) spatial correlations. More empirical work on the spatial correlations of these decisions, 
for example based on consumption data, in different situations and in different countries, would be very valuable - 
although the results on election turnout in many countries are already quite encouraging [85|. 



D. Multiple choices 



A generalisation of the rfim setting to multiple choices is interesting for obvious reasons: when in a bookstore, 
supermarket or record shop, the choice is not between two products but between thousands, or even millions of 
products. The same is true of the stock market - one can choose to invest in thousands of different stocks. Again, the 
choice of others is an important piece of information that we take into account before buying a book or deciding on a 
movie. It is plausible that herding effects play a role in the appearance of Pareto-tails in the measure of success (book 
sales, movie attendance, smart phones, etc.). A natural generalization of Eq. ^ to an M-choice situation reads (88| : 

<(t + l) = e(^^"+/f + JM7r„(t)), (34) 

where nf is the consumption of product a = 1, . . . , Af by agent i = 1, . . . , A^, Q{u > 0) = 1 and <d{u < 0) = 0, and 
tTq, is the relative consumption of product a, i.e. 



0- 



(35) 



The field -F" describes the intrinsic attractivity of product a, while /f is the willingness to pay of agent i, to acquire 
product a. As above, the J coefficient measures the strength of social influence, and a factor M is added to make 
sure that social influence remains of order unity when A/ — > oo. 

The most interesting question about such a model is to know whether the realized consumption is faithful, i.e. 
whether or not the actual choice of the different items reflects the true preferences of individual agents, as would 
be the case in the absence of interactions (J = 0). Based on the phenomenology of the rfim, one expects that this 
will not be the case when J is sufficiently large. The exact solution of the model in the limit M — > oo indeed shows 
that a phase transition takes place for J ~ Jc, where can be exactly computed in terms of the distribution of 
the Fs and the /s, assumed to be IID random variables. For J > Jc, strong distortions occur, meaning that the 
realized consumption will (i) "condense" into a small subset of all possible choices, (ii) violate the natural ordering 
of individual preferences and (iii) become history dependent: a particular initial condition determines the winners in 
an irreproducible and unpredictable way [s^. Instead of all products getting a ~ 1/M share of the market, some 
(but not necessarily the best ones, with large Fs) become 'hits' and attract a disproportionate number of buyers. 
This is a possible mechanism for the appearance of Pareto tails in most measures of success [9^ [9l| . Similar effects 
were observed in the artificial cultural market of Salganik et al. (67| . which allowed them to conclude that increasing 
the strength of social influence increases both the inequality of market shares and the unpredictability of success (i.e. 
which products become winners). 

The model can be transposed to other interesting situations, for example that of industrial production, for which 
one expects a transition between an archaic economy dominated by very few products and a fully diversified economy, 
as the dispersion of individual needs becomes larger as time passes. 



VI. CHOICE THEORY AND DECISION RULES 

A. The logit rule: arguments and extensions 

The literature on choice theory is extremely vast and has been beautifully reviewed in j49i | (although the field has 
progressed since 1992). We want here to restrict to a few comments that may help to understand some of the problems 
at stake. In a nutshell, classical choice theory assumes that the probability Pa to choose alternative a in a set A of 
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different possibilities can be written as: 



(36) 



where Ua is the "utihty" of alternative a and /3 the "noise" parameter that allows one to interpolate between 
equiprobable choices (/3 = 0) and utility maximisation (/3 — oo). Before discussing the arguments that have been 
put forth to justify the above "logit rule" , one should specify what one really means by "the probability Pa to choose 
alternative a". Two quite different interpretations can be found in the literature (see [49], pp. 30-33): 

• (1) One is that each agent, as a function of time, flips between different alternatives. At each time step the 
probability to choose a € ^ is the above Pa- This could be due to an imperfect knowledge of the utilities 

or to "irrational" effects such as illusions and intuitions, cognitive limitations, etc. It could also be due to a 
truly time evolving context or environment, not described in the theory, but that randomly affects the relative 
utilities of the alternatives, in such a way that the perceived utility of agent i at time t is = Ua + ^^(Oj where 
e^(t) are time dependent random variables. In this interpretation. Pa makes sense even for a single agent: it is 
the probability that, at a given instant of time he or she chooses a. 

• (2) The second interpretation assumes that any given agent i always makes the same choice, based on the 
maximisation of perceived utility [/^ = [/q, + e^, where ej^ are again random variables but now fixed in time. In 
this case. Pa is the probability that a given agent in the population has forever chosen a. 

At the aggregate level, the two interpretations may look indistinguishable (see [12]), and this is indeed true for 
non-interacting (independent) agents. In the presence of interactions, however, the two formulations lead to different 
results for the aggregate behaviour (see section fVI Bp . The first interpretation corresponds to the x-axis in the phase 
diagram of Fig 2, while the second corresponds to the y-axis. 

The proposed justifications for the exponential form of the probabilities in Eq. (1551) are of three types [49|: (a) 
axiomatic [43]; (b) based on specific assumptions about the random variables that affect the perceived utilities; (c) 
based on entropy arguments. On point (b): if the e's are IID random variables with a Gumbel (double-exponential) 
distribution, then it is possible to show that the probability is indeed given by Eq. [92]. However, the deep reason 
for choosing a Gumbel distribution has remained elusive - the fact that the Gumbel distribution is max-stable does 
not seem to help (see however [28j . pp 32-33). On point (c): one can argue that agents do not want to maximize the 
expected utility alone, but take into account the information cost (or the entropy) associated to the weights Pa- As 
is well known, the maximisation of P^a Pa^a — X]q Pa Ih-Pq indeed directly yields Eq. ([55)) . This has been called 
the variety seeking behaviour of agents in [4^ (pp. 7 8-79 ), and the exploration-exploitation compromise in (see 
also (9^). Still, all these arguments look a bit weak; |l54| the ultimate justification seems to be that the logit rule is 
mathematically very convenient [49], and allows one to use well known results in statistical physics because the logit 
rule is equivalent to the Boltzmann-Gibbs measure. 

We will explore more the dynamical interpretation (1) above, which seems to us more realistic. People can change 
their minds and not make the same decision all the time, even when confronted to the very same information (see 
the discussion in [1^). The idea of introducing a "tremor" noise in the utility dates back to Thurston [95*1. If there 
is time correlations in the noise, and no learning or feedback from the past that induce some systematic evolution of 
the utilities, one can think of a dynamical model of decisions as the following Markov chain. At each time step, agent 
i, having made choice a, reviews his alternatives with probability /i and does not bother with probability 1 — fi. The 
possible alternatives 7^0; are in a set Aa C A. Often people do not even consider all alternatives simultaneously but 
sequentially, so just pick one possible 7, and decide whether it is worth going for that one. (We make this assumption 
to simplify the following discussion.) It is reasonable to postulate that the probability for this alternative 7 to be 
adopted is equal to the probability that the noisy utility contributions are such that: 



Alternative 7 is therefore adopted with probability Qy{Ua — C/^), where (5> is the complementary cumulative distri- 
bution function of the noise differences. The time-dependent probabilities Pait) therefore obey the following Master 
equation: 



e'a{t) >Ua-U. 



(37) 





(38) 
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As is well known, a sufficient condition for havi ng; Eq. (p6)) as the equilibrium (long time) solution of this equation is 
the so-called "detailed balance" equation: |l55j | 



In 



.{U-,-U^) 



^{U^-U^), Va,7, 



(39) 



and provided that all choices are reversible, i.e. a £ A-y 'i^ 'f £ Aa, Va,7. Clearly, condition ([5^ is satisfied when 
Q>{u) corresponds to the logistic distribution: 



Q>{u) 



1 



(40) 



but other distributions may be suitable as well. However, there is no a priori justification for imposing that the 
distribution of noise differences should have such a property. If the e's are IID (or at least exchangeable), one can 
deduce in full generality that there is a function R such that: 



R{u) = -i?(-u), R'iu) > 0, 



R{u — > oo) = —. 



(41) 



So expanding for small m's one finds that the detailed balance condition is satisfied to first order with /3 — 4_R'(0), 
but may be violated to higher orders in utility differences: 



In 



Q>{u) 



In 



1 + 2R{u) 



1 - 2R{u) 



4R'{0)u + -(i?(^)(0) + 8i?'(0)3)u3 



(42) 



Not much is known about these non-detailed balance models. A natural conjecture is that for this particular class 
of models, for which the transition rates are functions of a utility (or energy) difference between the initial and final 
state, a quasi-equilibrium state is reached with no large scale currents. The robustness of the phase diagram and of 



the phase transitions to non-detailed balance terms (u 



etc.) is of course a crucial issue. 



The case of binary choice situations with no interactions between agents is clearly much simpler, and can be solved 
in full generality as: 



P(5,=±l) = i(l±m), 



2R{U+ -U-), 



(43) 



and the introduction of a mean-field interaction leads to a self-consistent equation the generalizes the usual Curie- Weiss 
equation (Eq. ([T3| above): 



2R{U+ -U- + Jm* 



(44) 



It is clear that for generic functions R{u) this model still has a second order phase transition when (3 J — 2, of the 
type discussed in section UlI Al (where 2R{u) — tanh(/3M/2)). First order transitions can occur for more exotic shapes 
oiR{u). 

But this point of view shows that there is no reason whatsoever to think that the noise parameter /? should be the 
same for all agents. Some may be more "rational" than others, meaning that a distribution of /3's could be needed - 
something that is unusual in physics. The interaction parameter could also be heterogeneous. The criterion for the 
appearance of hysteresis effects is, in the general case, (/3 J) — 2, where the average is taken over the population. 



B. Non interacting vs. interacting agents, and coordination failure 

It is worth dwelling a little more into the issue of detailed-balance decision rules in the presence of interactions. Call 
ai the choice made by agent i and P{ai, a2, ■ ■ ■ , ctN] t) the joint distribution of the choices of the A'^-agents at time 
t, and C — {ai,a2, ■ ■ ■ jCtAr} the configuration of choices made by all agents. The time evolution of the population 
is a random walk in the space of choices, and is determined by the transition probabilities W{C — >■ C) to go from 
configuration C to configuration C between t and t + 1. Assume that at each time step, only one agent can change 
his choice. The set of accessible configurations from C is made of all configurations that differ from C by a single 
agent changing his choice. If every single agent i uses a logit-rule to update his choice from Ui to 7i, with possibly i 
dependent utilities C/^, one finds for the transition probabilities: 
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Assuming individual choices are reversible in the above sense, do these transition probabilities W{C — )■ C) obey 
detailed-balance? Superficially it seems to be so, but the presence of interactions can ruin the detailed-balance 
property. Detailed-balance is obeyed provided there exists a certain function 'H{C) that depends on the choice made 
by all agents, such that: 

C^7, - C^a, = ^("ii"2, • ■ • ,7ii • • ■ ^aw) - ^(ai,a2, • • • ,ai, . . . ,aAr), Vi, Va,7. (46) 
In this case, one indeed has: 

In [^||^] mn n{C)] , (47) 

such that the long-time equilibrium probabilities are given by the Boltzmann-Gibbs weight: 

7'(C,i^oo) = Z-ie'^^»'^». (48) 

When agents are non interacting, in the sense that the utilities are independent of the choice made by others, then 
clearly: 

n{C) = J2ui,^. (49) 

i 

When there are interactions, this is no longer true, and in fact it is in general not possible to construct such a function 
(the consequences of this impossibility will be discussed at the end of this section). A simple case where such a 
construction is possible, for example, is the binary choice situation where ai — Si — ±1, and where the utilities take 
the following form: 

U'h^^~^-^S, + Y.X,^S,S,, (50) 

with Jij — Jji an arbitrary symmetric matrix. In this case, one can check by inspection that H is given by: 

n{Se}) = -J2^S, + lY.J^jS.S,. (51) 

i i^j 

One sees on this example that H is not the sum of individual utilities. In fact: 

i i^j 

The difference between these two quantities can have important consequences, as pointed out in the beautiful paper of 
Grauwin et al. . They consider a variant of the Schelling model of segregation, where each agent i is sensitive to the 
density g G [0, 1] of the neighbourhood in which he/she is living. Agents do not like living is derelict neighbourhoods 
(g ~ 0) nor in overcrowded neighbourhoods {g ~ 1). Their incentive to live in a neighbourhood situated around r is 
a certain function ?j[£i(r)] that has a maximum at g — 1/2; for example v{u < 1/2) = 2u and v{u > 1/2) — 2 — 2u. 
The global utility is a functional of the density field g{'r) and is given by: 



t/G ({e(r)}) = / drg{r)v[gir)l (53) 

where L is the linear size of the city. Suppose that the size N of the population is such that N/L'^ — 1/2. It is in 
principle possible in this case to make everybody happy and reach the absolute maximum of the global utility of the 
population by having all neighbourhoods ai g — 1/2, in which case Uq — iVwmax, with Umax = — 1/2) — 1. 

Now, what will really happen if agent choose their neighbourhood based on their own utility? More precisely, let 
us assume that each agent can move from r to with a probability given by the logit-rule, i.e.: 

W{r^r') = - , f-,, , (54) 

The function % alluded to above is also a functional of g{r), and according to the general principle expressed by Eq. 
it should be such that, for all fields g{r): 

vW)]~v[g{r)]- (55) 



5g{r') 5g{r) 



19 



where we have assumed that the elementary area of a neighbourhood is normahzed to one. The solution is: 

dg v[q] + AQ[r\ 



( df 


[L 







(56) 



where A is an arbitrary constant, which can always be taken to zero since the total number of agents is assumed to 
be conserved. Therefore, unless u is a constant, independent of g (i.e. for non i ntera cting agents), % 7^ Ug- 
Now, the equilibrium probability is given by the Boltzmann-Gibbs measure: [l56| 



P[{£i(r)}] oc exp 



/3 / dT{V{g[r\)-\g[7^] 



(57) 



with V' = V and A is a Lagrange parameter ensuring that the average density is fixed to a certain p. The most likely 
density configuration, for /3 ^ 00, is such that V' = X, Vr. For the above symmetric piecewise linear choice for w(u), 
one finds that the local density must equal to g = 1/2 — 6 with probability p, and g — 1/2 + 5 with probability 1 — p, 
with p such that g=l/2 + S{2p - 1), and 6 G [-1/2, 1/2] such that: 



m =P[l-^) + (1 - + 2<5 - (^ + S)A =\ + {g- l){6 1) 



(58) 



is maximized. In the case where g > i, the maximum is realized for 6 = 1/2, and when p < i, it is realized for 
6 = —1/2, but both solutions correspond to complete segregation, i.e. completely full {g — 1) or completely empty 
{g ~ 0) neighbourhoods! (The case where 'g — 1/2 is degenerate, but any infinitesimally small departure from this case 
leads to segregation). Therefore, letting people optimize their own utility, to the detriment of the global well-being, 
leads to a disastrous situation. This is a stunning illustration of a case where Adam Smith's invisible hand totally 
fails to bring the societ y to a global optimum. 

Now, as proposed in [96[ , coordination can be improved by making individual conscious of the common good when 
they make their choice, by adding to the change of individual utility a term proportional to the extra cost in the 
global utility, i.e. change the logit-rule according to: 



Av = v[g{r')] - v[g{r)] — > Av + ( [AUg - Aw] 



(59) 



Since Ug is already a functional of the density field, it is immediate to see that the function 'H(;[{p(r)}] adapted to 
the new choice update rule is given by: 



He = {1-C)H + CUg- 
Correspondingly, the equilibrium probability is now replaced by: 



Pdioi"^}] exp 



/3 / dr{{l - Ov{gM) + CgMuig[r\) - Xg[r\} 



L2 



(60) 



(61) 



The most likely configuration can again be computed. One finds that it becomes the socially optimal solution 
g[r\ = 1/2 and Uq/N = Umax as soon as C > Cc = 1/3. As stressed by Grauwin et al., this model shows that the 
optimization strategy based on purely individual dynamics fails, illustrating the unexpected links between micromotives 
and macrobehavior. Making individuals aware of the costs of their choice for the society is, in some cases, necessary 
to nudge the system into cooperation/coordination [9^. 

However, the decision rules that obey detailed-balance are, in the general case of interacting a gents, th e exception 
rather than the rule. Very few general results are known when detailed-balance is violated, see e.g. [98l - llOCll |. Of course, 
some equilibrium will be reached at long times, but there is no explicit construction of the equilibrium probability, 
similar to Eq. (|48l) above. One knows that in the general case, equilibrium currents are present. This is due to the 
fact that the probability to cycle 'clockwise' around a closed loop in decision space is different from the probability 
to cycle 'counter-clockwise' - except when detailed-balance is satisfied. Studying and interpreting these currents in 
a socio-economic context would be very interesti ng, i n pa rticular because these currents are known to considerably 
speed up the equilibration time in certain cases jlOOl Il0l| , preventing the system to get "trapped" in deep maxima 
of the function H (the equivalent of deep energy wells in physics). For example, a natural extension of the model 
of Grauwin et al. that would exhibit equilibrium currents and (perhaps ?) mitigate segregation, would be quite 
appealing. 
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VII. IMITATION OF THE PAST AND MORE INSTABILITIES 
A. Memory &: Habit formation 

Any realistic model of decision should in fact take into account that the utility/incentive of a given choice cannot 
be thought of as time independent. For example, it is hard to know how useful/interesting choice a is without having 
tried it at least once (think about restaurants, for example). A simple way to model this would be to to assume that 
the perceived utility of choice a, U^^, is initially blurred by some estimation error that decays to zero as the total 
time spent choosing a grows; for example: 

4W = eL(i = 0)6-^'^°*, (62) 

where ifa is the fraction of the time during which a was chosen. In effect, this should be similar to having an 
effective parameter (3{t) that increases over time. This is the equivalent, in physics, of an annealing process where the 
temperature T — 1//3 slowly decreases with time. 

As the relative utility of choices becomes better known to agents as time evolves, a second, more important effect 
might kick in: habit formation or "stickiness" . Often, past choices become more valuable only because they happened 
to b e chosen. This can be for good reasons, like creating a loyalty relationship (such at the Marseilles fish market 
[102| ). or because going to the same doctor makes that doctor know your personal problems better, etc. But it can 
also be because of high risk aversion (maybe another choice is better, but maybe it is much worse, so better stick to 
what I have) or sheer intellectual laziness. Hotel and restaurant chains are based on this strong universal principle: 
people often tend to prefer things they know. 

This can be formalized as follows: define an indicator function 9a{t'), which is equal to 1 if choice a is made at 
time t' , and zero otherwise. The utility of choice a at time t for a given agent is written as: 

C/„(i) = [/„(0) + ^"(^ - *')^"(^'), (63) 
t'<t 

where Ua{Q) can be thought of as the 'true' utility of choice a and Ka{T) is a certain memory kernel, which measures 
how much of the far away past is affecting the utility today. When > 0, choices are self-reinforcing. This model, 
with an exponentially decaying memory Ka{T) = xq'^ , was introduced in [OSj to model loyalty formation between 
buyers and sellers in the Marseilles fish market. Suppose initially Ua(0) = 0, i.e. all choices are a priori equally good. 
What happens at long times? Is loyalty a long-term emergent property due to 'imitation of the past', similar to the 
appearance of 'winners' in the artificial cultural market of Salganik et al. due to imitation of past choices of others 
([67| and section IV D I above) ? 

The analysis proposed in [o^ is based on a mean-field argument. If the utilities Ua{t) do not change too fast, 
equilibrium can be assumed to hold at each time step, leading to: 

P„(i)«^(t)-^e^^°(*\ (64) 

where Z{t) is the time dependent normalisation. Now, replacing the history dependent Ua{t) by its average leads to 
a self-consistent equation for Pa{t). Assuming that Pa{t) converges at large times towards P*, these should be given 
by the solution of: 

In P* = - In Z* + ^Pa* (65) 

Clearly, P^ — 1/M, Va — 1,. .. ,M is a always solution, corresponding to the absence of loyalty formation - if a 
denotes the seller in a fish market, it is a situation where buyers tend to change monger every week, and no long term 
relationship arises. However, this fully diversified solution becomes unstable when /3>cq > M(l — q), and the solution 
becomes 'polarized', i.e. a habit, loyalty or addiction spontaneously appears beyond some reinforcing strength >c, 
assuming the memory time (set by q) is fixed. Note that this is an example of a condensation phase transition which 
occurs even with a single agent, since we have not introduced any interaction between agents in the model, only a 
memory feedback. 

This is a very nice effect, but unfortunately the mean-field argument is flawed in this particular case. In order to 
see why and to possibly rescue the result, let us focus to the binary decision case, where a{t) — S{t) = ±1. In this 
case, only the difference U+{t) — U-{t) matters, which can be written as: 



U+{t) - U-{t) = U+{0) - U-{0) + ^ K{t - t')S{t'). 

t'<t 



(66) 
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Assuming U+{0) = U-{0) and using the logit decision rule Eq. ([3]) with /i = 1, one estabhshes that the probabiUty to 
observe a certain history of choices {St}o<t<L is given by: 

_ exp [/3 Eo<t<L Sit) Y.t'<t K{t - t')S{t')^ 

^ - no,.,.2cosh[/3E.<.i^(*-t')^(^')] ■ ^''^ 

But this is similar to the Boltzmann-Gibbs weight for a spin configuration of the one dimensional Ising model, with 
an interaction matrix K{t — t') that depends on the "distance" t — t' between the spins. The difference comes from 
the denominator, absent in the Ising model; but o ne ca n argue that if there is no phase transition in the Ising model, 
there is no transitio n in the a bove model either. |l57( Quite a lot is known about the one dimensional Ising model 
in the limit L — >■ oo jl03l llO^ - In particular, it can be proven that when K{t) decays faster than when r oo, 
the system is always in the high temperature, disordered phase, where the magnetisation is zero. This means that 
the two choices remain equally probable at long times; there is no loyalty formation and ergodicity is not broken. 
Still, it is known that the correlation length of the Ising model (and thus the correlation time in the present version) 
grows exponentially fast with the interaction strength. For example, when the memory decays exponentially as in 
the model of |93|, one find that the typical time during which an agent remains faithful is given by exp[2/3>f(3'/ (1 — q)] 
when jSx ^ 1. The transition found in (93| is thus in fact only a crossover between a fickle phase and a sticky phase. 
However, for longer ranged memory kernel, a true phase transition takes place. Suppose K{t) decays asymptotically 
as T~'^ with 1 < ly < 2, then there is indeed a critical value of /3 beyond which loyalty sets in the sense that the 
probability to choose +1 or —1 is not 1/2, 1/2 anymore. The case marginal = 2 is in fact special and ve ry famous : 
it corresponds to the Thouless- Anderson- Yuval model with a long history in condensed matter physics [l03l Il05j | . 
When v < 1, loyalty appears for any non-zero value of /3. Similar conclusions are expected to hold for multiple choice 
situations as well. Th e cas e where memory adds to interaction between agents was recently considered in the context 
of the Ising model in 

It would be interesting to try to determine empirically whether people's memory of past choices decays exponentially 
or as a power - law. Power-law memory is well documented in many human activities, including financial markets, see 
e.g. [ToTl - fTogj and references therein. We see from the above model that this question might be quite relevant for the 
appearance of persistent choices and decisions, or self-fulfilling prophecies, which are very closely related to the above 
scenario, and to which we devote the next section. 



B. Conventions and self-fulfilling prophecies 

As we mentioned above, habit formation is due to 'imitation of the past' rather than 'imitation of the peers'. In 
fact, both effects can be present and reinforce each other. (l58j | We want to pursue in that direction and consider 
other theoretical scenarios of feedback of the past on current decisions, that may lead to self-fulfilling prophecies and 
endogeno us crashes 0. This of course opens the whole field of learning models, much emphasized in e.g. [20l. [28l. 
llld . Illlj . that is much too big to be fairly covered in this paper. 

In the context of repeated decisions (such as investing in financial markets), a common temptation is to compare 
the present situation with similar situations from the past, and posit that what already happened is more likely to 
happen again. As Brian Arthur puts it: As the situation is replayed regularly, we look for patterns, and we use 
these to construct temporary expectational models or hypotheses to work with 47]. Often some plausible story is given 
to understand why such a pattern should exist. This convinces more participants that the effect is real, and their 
resulting behaviour reinforces (or even creates) the effect: this is a self-fulfilling prophecy. A large consensus among 
economic agents about the correlations between a piece of information and the market reaction can be en ough to 
establish these correlations. Such a 'condensation' of opinions leads to what Keynes called a convention (ll. Ill2| . a 
commonly shared representation of the world on which uncertain agents can rely on to make a decision. However, 
as he further noted, a conventional valuation which is established as the outcome of the mass psychology of a large 
number of ignorant individuals is liable to change violently as the result of a sudden fluctuation of opinion due to 
factors which do not really make much difference.... 

As a striking example of a sudden change of convention, we show in Fig. 5 the correlation between bond markets 
and stock markets as a function of time. This correlation was positive before 1997, because the common lore suggests 
that low long term interest rates should favor stocks. Since bonds move opposite to rates, an increase in bond price 
should trigger an increase in stock prices. But another story has suddenly taken over in the midst of the 1997 Asian 
crisis: a fall in stock markets triggers an increased anxiety of the operators who sell their risky paper and buy non- 
risky government bonds. This so-called Flight to Quality then became the dominant pattern. There are many other 
anecdotal examples of such conventions in financial markets: indicators or news items (infiation, payrolls, etc.) that 
everybody pays attention to at one moment in time, before collectively deciding it is irrelevant after all. 
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FIG. 5: Left: Normalized correlation between the Dow- Jones daily returns and the daily returns of a U.S. bond index with 7 
to 10 years bonds, computed with an exponential moving average of 100 days. Note the convention change occurring at the 
end of 1997. Inset: Zoom on the evolution of the Dow- Jones and the bond index in the last quarter of 1997, during the Asian 
crisis. Right: One characteristic trajectory of the dependent correlation ct in the model defined by Eqs. (|70l72p . for g = 1.2 
and 1 - g = 0.01. 



The sudden shift seen in Fig. 5 is quite remarkable. A model that could explain such a change in convention 
was proposed in [IIS] . Call pt the (log-)price of a certain asset at time and bpt the return between t and t 
Some agents base their strategy for buying or selling on the observation of the temporal change of a certain index Xt , 
which might be a financial index or an economic indicator (for example, dividends, interest rates, infiation, confidence, 
unemployment, etc.), or even the price pt itself or the price of another asset. Denote as Sit the change of this indicator 
between t and t + l, normalized to have zero mean and unit variance. Suppose that it is reasonable that there should 
exist a causal correlation between STt and Spt+i, i.e. that the correlation coefficient between the two time series is: 



't+ij 



It is well known that the best prediction (in a quadratic sense) of Spt+i knowing Sit is given by 



5p[ 



t+i 



cSIt 



(68) 



(69) 



Now, imagine that there are two types of agents, those who act based some information uncorrelated with Xt and 
those who try to take advantage of the possible correlations between Sit and Spt+i- Since the true, fundamental value 
of the correlation c is in fact not known (and might actually be zero!), agents of the second type attempt to learn this 
value from past history. Similar to the above model of [931, let us assume that the memory of the past is decaying 
exponentially, leading to an estimated value of c at time t given by: 



Ct 



1-9 



t-i 

E 



q'-*\SIt'Spt'+i), 



(70) 



where q sets the memory time of the exponential moving average. Equation (j70p is equivalent to the following 
Markovian update of the estimated correlation ct = qct-i -I- (1 — q)SIt-iSpt. 

Agents using the information based on It then decide to buy (or sell) a quantity $i that is an odd function of the 
predicted next return Spt^i oc CtSIt- One expects on general grounds that the demand function $4 is linear for small 
arguments and saturates for large arguments. The small c expansion should thus read: 



^t ^ actSIt - HctSItf + 0{c^), 



(71) 



where a, b are positive coefficients. These orders add to the demand fit of all other agents, who use a different type 
of information. Assuming linear impact of the aggregate order imbalance |l59| . we finally end up with the following 
expression for the realized price change at i + 1 : 



Spt+i = X[nt+ actSIt - bcplf] + O(c^), 



(72) 



where A is the impact coefficient, relating volume imbalance to price changes: Sp = X^. We will define the feedback 
strength parameter as g = aA, which plays a central role in the model, and assume that the 'true' long time average 
correlation c is zero, i.e. there is in fact no information in It- 
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Combining Eqs. (|70l72p and taking a continuous time limit (justified in the limit of long memory kernels, g — >■ 1), 
one finally finds the following Langevin equation for c (see [TTsj for details): 

dc = -^dt + d^, (73) 
ac 

where is a Brownian noise (related to ft) of variance I — q and V is an effective potential given by: 

Vic) = l{l-g)c' + ^b'c\ (74) 

with b' another positive coefRcient |ll3j |. 

The equilibrium distribution of estimated correlations c is then easily found to be given by: 



Z "V 1-9 

For g < 1 the potential has an absolute mini mum at c = 0, whereas for 5 > 1, c = becomes a local maximum and 
two stable minima appear for c = ±c* = ±\/ (5 — !)/&'• Therefore, for g < I (weak feedback), P(c) is unimodal and 
has a maximum at c = 0, reflecting the underlying reality. When feedback is stronger (g > 1), however, the most 
probable values for c are ±c* . This means that for strong feedback, a non-zero fi ctitio us correlation between the price 
and the indicator spontaneously appears - this is the so-called sunspot effect |llCll |. The correlation can be either 
positive or negative, corresponding to the two possible 'conventions'. However, the long time average of correlation is 
still zero even when g > 1, since c randomly flips between ±c*. But the time needed to do so is however exponentially 
large in the memory time of the agents, 1/(1 — q). Hence non-zero estimated correlations can therefore persist for very 
long times, as in the exponential memory model of [93l |. One therefore expects to observe long periods during which 
c sa +c*, until the convention reverts quite abruptly towards c ~ — c*, as observed for the bond-index correlation 
shown in Fig. 5, where we also show a numerical simulation of the model corresponding to g = 1.2 and q — 0.99. 
Finally, suppose now that there indeed exists a small objective correlatio n cn < C 1 between Spt+i and Sit, justified 



by some real economic mechanism relating the two quantities. One can show [113l | that this is accounted for by adding 
a linear contribution to the effective potential V{c): 

Vic) V{c) - coc. (76) 

For g < 1, the most probable value of c is co/(l — g) (= cq for g = 0, as it should in the absence of any feedback). 
Therefore, c is of the same order as its true cause whenever g < I. However, in the limit 5 — )■ 1~, the apparent corre- 
lation becomes much larger than its true cause: the sensitivity of the market to external information is anomalously 
amplified by the impact of traders using that information. For g > 1, the amplitude of the apparent correlation is 
totally unrelated to that of the true correlation, although the sign of the correlation reflects the underlying economic 
reality. One describes here a typic al ex ample of overreaction to news leading to excess correlations that are well doc- 
umented in the literature (see e.g. |ll4l |). For example, the correlations between the stocks bel ongin g to an index and 
the index itself are too strong to be explained by the intrinsic correlati ons b etween the stocks [llSj . These increased 
correlations seem to reflect a self- induced "risk-on/risk-off" mode (see [1165 fo'" similar ideas). 



C. Self-induced bubbles and collapse 



In the previous model, agents were trying to learn correlations to make a forecast of the next price change. But 
among the most pervasive biases in human perception is the detection of trends (or the illusion of trends) . The human 
eye seems to be programmed to make a "fast and frugal" linear regression, and extrapolate the line into the future 
(see [llTj for experimental evidence of this forecasting rule). Humans (and of course, animals) are also very good 
at detecti ng danger. These two basic features can be minimally encapsulated in the following "mood indicator" Ai 

[nlllii: 

Mt^aTt- bUt, (77) 

where 7t is a trend indicator and TZt a risk indicator, and a, b again two positive parameters. The a term describes 
the unrelenting propensity of humans to follow trends, while the b term can be thought of as a risk aversion term, 
prompting investors to sell when the perceived risk becomes too large. For simplicity, one assumes that both 7t 
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and TZt are constructed as above as exponential moving averages of respectively the past price changes and the past 
squared price changes: 

q ^-^ q ^-^ 

t' — — 00 t' — — OQ 

Now, as argued in |ll9j | and more recently in [H^, supply and demand in financial markets cannot be instantaneously 
matched. This is due to the fact that the immediate outstanding liquidity in the order book, or provided by market 
makers, is a tiny fraction (say 1 % or less) of the total daily volume. So in fact there is alwa ys a certain amount of 
pending demand or pending supply, which was called "latent liquidity" in 56]. The idea of |ll9| was to write the 
following dynamical equation for the latent order imbalance $t: 

- $t = +Xt + r2t (79) 

where the first term comes from the fact that a well-functioning market allows to the order imbalance to relax to 
zero on a time scale (through execution and cancellations), whereas the second term describes the arrival of new 
buy orders or sell orders, triggered by the mood indicator Mf The extra random noise fit is coming from exogenous 
events or from the behaviour of agents following different indicators. The price evolution equation is still given by the 
simplest linear model: on average, the latent order imbalance $t biases the price upwards or downwards as 6pt = A$t. 

There are two time scales in the model: (related to the liquidity of the market) and (1 — q)^^, the memory 
time over which trends and risk are computed. The analysis of the model can be made in both limits fi ^ (1 — g) 
and /i ^ (1 — g), with similar conclusions, see (ll9j |. In order to keep the discussion as simple as possible, we focus 
here on the case of extreme short memory, i.e. g — > 0, in which the dynamics of the price is given by: 

Pt+2 - 2pt+i + pt = A($t+i - $t) = -fi{pt+i ~ pt) + a\{pt " pt-i) ~ bX{pt " Pt~i)^ + Afit. (80) 

This evolution is easier to interpret in the continuum time limit. Set v = dp/dt (the price "velocity"); up to small 
terms the above equation then reads: 

1-^ + ^"'. »'(.):^(.-a4 + 4 ,81, 

which describes the overdamped evolution of the position v of a fictitious particle in a cubic potential W{v), in the 
presence of noise. 

When trend following effects are small, or the market liquid enough (Aa <^ /i), the potential W{v) has a mi nimu m 
at u = 0, which translates into a random- walk regime for the price itself (at least on time scales ^ fi~^ , see [ll9| ). 
But W{v) also has an unstable maximum for a negative value v*, beyond which the potential plummets to — oo. 
This describes a risk-aversion induced crash: when several small negative shocks fij accumulate by accident, the risk- 
aversion term —bTZt induces further selling pressure. This unstable feedback loop leads to a crash once the tipping 
point V* is reached. But for the crash to occur, an "energy barrier" W(v*) has to be crossed, a situation that we 
have encountered in the above sections (see sections IlII B I and I VII B [) . The time needed to cross this barrier is, again, 
exponentially large (in l/&^): for small enough risk-aversion, these crashes are very rare events. 

Now, the interesting regime is when trend following effects become strong, i.e. when Aa > /x - meaning that the 
market cannot fully digest the pressure imposed by trend followers. In this case, the potential W{v) has a stable 
minimum for v* = {Xa — ^)/b > around which v(t) oscillates. This means that a trend has appeared: the (log-)price 
is a random walk with drift v* and thus on average increases linearly with time, as pt —po ~ v*t. This is a self-induced 
speculative bubble, which persists because of the trend following tendency of investors, and the inability of the market 
to absorb the demand quickly enough. 

Within the context of the model, there are two possibilities for the bubble to burst. One is a risk-aversion crash, 
as discussed above, where accidental drops of the price, driven by the noise term fit , bring the system close to the 
"cliff", i.e. the maximum of W{v) now situated at ?; = 0, beyond which the crash unfolds. The second possibility is 
the presence of investors believing in a fundamental price level, p* . It is reasonable to model this class of investors as 
contributing to fit in proportion of the difference between the actual price pt and the fundamental price p*. Clearly, 
a large discrepancy is an incentive to bet on a probable reversion of the price towards p* . As the bubble grows, the 
price Pt moves further and further away from the fundamental value: pt — p* ~ v*t. In a first approximation, this 
effect adds to W{v) a time-dependent linear contribution: 

2 3 

Weff{v)^X(bv*tv + {tx-Xa)^ + Xb'^, (82) 
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where </> > measures the strength of fundamental investing. Interestingly, the minimum of this potential (corre- 
sponding to the self-sustained bubble) only exists as long as: 

-**<Tf^^*<*^-^- (83) 

When the price is too high, the self-sustained bubble becomes unstable and a crash also occurs, with v moving to 
large negative values. Note that, as expected, becomes small if fundamental investing becomes dominant (i.e. 
large) . 

Note that Eq. (|83p only gives a rough order of magnitude of the critical time tc beyond which the minimum of the 
potential ceases to exist. The real mathematical problem requires one to deal with the following non-linear stochastic 
differential equation: 

^ = (Aa-^)$-6A('$ " 
dt^ ^ ^' dt \ dt 



-£^(Xa-^,)-^-bX{-^] -X^{p-p*) + \nt. (84) 



However, the simplified treatment in terms of an effective, time dependent potential Wi,ff{v) suggests that the crash is 
associated to the sa me g eneric saddle-nod e bif urcation as the one recently studied in [120] in the context of earthquakes 
and material creep. [l60| The authors of |l2C| suggest that some precursor effects, like a shift of the power-spectrum 
of the fluctuations to lower frequencies, could be used as a signal of the incipient instability. Whether these ideas can 
be transposed to financial instabilities is of course highly conjectural. 

The above model, although highly stylized, seems to capture several interesting facets of the dynamics of financial 
markets: trends feed trends and risk-aversion can lead to crashes. All these effects come about because past events 
feedback onto present decisions. The model also illustrates the competition between trend following strategies and 
fundamental strategies, that leads to bubbles and crashes, and intermittent dyna mics - as i s also the case for several 
agent-bas ed models of financial markets with the same ingredients (see e. g. jl22l . Il23j , and for recent reviews, 



VIII. CONCLUSION 



Fi nancial and economic history is strewn with bubbles and crashes, boo ms and busts, crises and upheavals of all sorts 
|l28l |. Time series of prices exh ibit disco ntinuities on all scales [tI. Il29| . Often, these major events seem unrelated 
to any fundamental news (see [T30l - [l32| V or at least look incommensurate with any reasonable proximate cause. 



Equilibrium models, often expressed in terms of the behaviour of a single rational 'representative agent', are perhaps 
adequate to describe small excursions around a stable situation, but are unfit to explain, or even accommodate, major 
disruptions. The big problem is that these crises are anything but rare [70|]: the economy is not a ship sailing on a 
well-defined trajectory which occa sionally gets knocked off course As stressed by Kirman, the economic crisis is 
a crisis for economic theory [133]. However, having declared the old theory dead and buried is not enough. One is 
faced with a crucial problem: providing a viable alternative. Until such an alternative theory is available and to some 
extent operational, all professional economists (professors, researchers, experts of all kind, policy makers) will prefer 
to stick with the previous paradigm. After all, Ptolemea's epicycles remained for many years better than Newton's 
theory at predicting the motion of planets. 

The aim of this paper was to review recent efforts to include heterogeneities and interactions in models of decision. 
We have argued that the rfim provides a unifying framework to account for many collective socio-economic phenomena 
that lead to endogenous ruptures and crises. Interestingly, the rfim has been proposed as a prototype model to describe 
many physical crises: magnetic avalanches, earthquakes, solar flares, material failures, etc. (35j . In all these examples 
of "complex systems", small causes may lead to large effects. The analogy between these physical phenomena and 
economic/financial turbulence is enticing, at least at a superficial level. In this respect, it is certainly rewarding to 
set up models that can capture potentially destabilising self-referential feedback loops (induced either by herding, i.e. 
reference to the peers, as in sections IlII Al HV Bl or trending, i.e. reference to the past, as in sections I VII Al fVII Bl 
and I VIl"C|) . and that can reproduce some of the phenomenology missing in the standard models. However, the real 
challenge is to move up one notch and to upgrade these cartoon models into real models. For example, it is in our 
view crucial to include in a consistent way the dynamics of trust - and the possibility of sudden trust breakdown, 
along the lines sketched above - in macro-economic models, in su ch a way that extreme events and crises can be 



accounted for. This is an ambitious project, which is only starting. 161 [ 

Cartoon models are useful on many counts. But in the present case, did we learn anything that was not put by hand 
in the model? I believe so, at least insofar as precise, well defined questions with empirical answers can be asked. 
For example, the robust signature of RFiM-like herding effects reported in section IV Al or the logarithmic spatial 
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correlation of voting patterns (section fV CI) are unexpected predictions of quantitative models. Still, one would like to 
go beyond the level of plausibility and metaphors, and reach indisputable quantitative agreement between at least one 
non-trivial prediction of these self-referential feedback models and some empirical data. For example, the power-law 
distribution of avalanche sizes predicted by the rfim is tantalizing and might enable one to understand why burst of 
activities and power-laws appear in e.g. the distribution of returns in financial markets. But there is not much more, 
at this stage, than wishful thinking and hand-waving. The possible existence of a condensation phase transition in 
multiple choice situations, of the type described in sections IV Dl and IVII Al seems to be a highly relevant question, 
again amenable to precise empirical investigations (see e.g. (67L ilO^). 

In my opinion, the most striking example of surprises (i.e. results that are not anticipated before a model is solved) 
encountered in this review is the clear quantitative demonstration, within a Schelling-type segregation model [96j , 
that Adam Smith's invisible hand can badly fail at solving simple coordination problems - see section IVI Bl I have 
also tried to i nsist on the issue of time-scales, that are generically extremely long in heterogenous interacting sytems 
(see e.g. |l34| ). This may prevent a socially optimal equilibrium to be reached, unless some kind of catalysis nudges 
the system through an otherwise unsurmountable "barrier" - see section IlII Bl As a theoretical question, the study of 
detailed-balance violating decision rules (which must be the rule rather than the exception) seems to me well worth 
the effort, in particular to decide whether conclusions based on detailed-balance rules are indeed robust and generic. 
Of course, a unifying description of non-equilibrium stationary states is a major challenge in statistical physics as 
well! In any case, I hope the reader will be convinced that statistical physics can contribute to a better understanding 
of collective socio-economic phenomena, and that these theoretical ideas could one day be ripe enough to have a real 
impact on policy issues. 
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[140] The distinction between the two interpretations will be discussed again in section IVl Al 

[141] It is interesting to notice that in Kirman's ant recruitment model [3, one rather has P{N+ N+ + 1) = P{N+ 

N+ — 1) = IJ,(I>{1 — 4>) because change of opinions are supposed to happen during two-body "encounters" where one of the 
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of a one-dimensional random walk. Indeed, the critical branching process can be mapped onto that problem, see [g^] for 

more elaborations on this point. 
[146] This universality was also noted and emphasized in [s^. |40|]. 

[147] The RFIM is actually well known to have a rather wide critical region, as emphasized in [sj]. 

[148] We emphasize again here that if J = 0, or if the variations were due to different speeds v in different countries, one should 
observe k = 1. 

[149] see also [t^ . [tJ for experiments on information cascades and 78] for some empirical evidence of tipping points in urban 
segregation phenomena. We also refer the reader to [79| for a review on "econometric" approaches to social phenomena. 

[150] I personally know people who cashed their money just after Lehman's default, and left their bank with a plastic bag full 
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[151] In the following expression, (...) denotes an averaging over all positions R. 

[152] Note that here we neglect the immediate influence of others in the actual decision to vote, i.e. the JSiSj interaction term. 
See the discussion of this point in [g^ . 

Taking the relevant inter-town distance to be ~ 10 km and the time for opinions to get closer to be a few months, one 
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can estimate D to be of the order of magnitude of a few hundreds km^ per year. 



Other arguments, inspired by statistical physics, could go as follows [9^ : imagine a choice consisting in two sub-choices 
concerning issues in disjoint sets A, B. These two sub-choices are furthermore assumed to be independent, i.e. the choice of 
an alternative a in ^ does not impact the utility of any of the alternatives h in B, and vice- versa. The utilities are therefore 
additive in that case: ?7q®6 = Ua -\- Ub- Since the sub-choices are independent, one should also have Pai^b = Pa x Pt- 
Looking for probabilities that depend on the total utility of the choice therefore selects the exponential form. One could 
also try to replicate the usual canonical construction of the Boltzmann weight, by arguing that a choice is never in 
isolation but interacts with many other choices, while the agent is only interested in the total utility. Sub-optimal choices 
(corresponding to a finite /3) are allowed because they only have a small contribution to the total utility. However, these 
arguments are somewhat ad-hoc. 
[155] One should of course keep in mind that the time needed to reach full equilibrium might be very large, or even infinite if 

there is a phase transition and ergodicity breaking - something that requires the number of states to be infinite. 
[156] We neglect here an entropic term, which is small when /3 — >■ oo, see [9^ for details. 

[157] More rigorous work is needed on this whole issue; in particular on whether the existence of a transition in the Ising model 

is sufficient to ensure loyalty formation. 
[158] Interesting situations could occur when these two effects are in conflict, for example when overcrowding or saturation 

effects prevent full condensation. 
[159] Whereas the impact of individual orders in strongly concave, the impact of the aggregate order imbalance is, to a good 

approximation, linear. See 55, 56] for a long discussion of this issue. 

[160] In fact, the rfim has also been related to the physics of ear thqu akes and material creep, see [H, Il2l[ ]. It would be 
interesting to make a detailed connection with the work of ref. [l20 



[161] This is one of the ambitions of the CRISIS project, see: http://www.crisis-economics.eu/home 



